skip to main content


Search for: All records

Creators/Authors contains: "Gaither, Michelle R."

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Genomic data are being produced and archived at a prodigious rate, and current studies could become historical baselines for future global genetic diversity analyses and monitoring programs. However, when we evaluated the potential utility of genomic data from wild and domesticated eukaryote species in the world’s largest genomic data repository, we found that most archived genomic datasets (87%) lacked the spatiotemporal metadata necessary for genetic biodiversity surveillance. Labor-intensive scouring of a subset of published papers yielded geospatial coordinates and collection years for only 39% (51% if place names were considered) of these genomic datasets. Streamlined data input processes, updated metadata deposition policies, and enhanced scientific community awareness are urgently needed to preserve these irreplaceable records of today’s genetic biodiversity and to plug the growing metadata gap. 
    more » « less
  2. The banded coral shrimp, Stenopus hispidus (Crustacea: Decapoda: Stenopodidea) is a popular marine ornamental species with a circumtropical distribution. The planktonic larval stage lasts ∼120–253 days, indicating considerable dispersal potential, but few studies have investigated genetic connectivity on a global scale in marine invertebrates. To resolve patterns of divergence and phylogeography of S. hispidus , we surveyed 525 bp of mitochondrial cytochrome c oxidase subunit I (COI) from 198 individuals sampled at 10 locations across ∼27,000 km of the species range. Phylogenetic analyses reveal that S. hispidus has a Western Atlantic lineage and a widely distributed Indo-Pacific lineage, separated by sequence divergence of 2.1%. Genetic diversity is much higher in the Western Atlantic ( h = 0.929; π  = 0.004) relative to the Indo-Pacific ( h = 0.105; π  < 0.001), and coalescent analyses indicate that the Indo-Pacific population expanded more recently (95% HPD (highest posterior density) = 60,000–400,000 yr) than the Western Atlantic population (95% HPD = 300,000–760,000 yr). Divergence of the Western Atlantic and Pacific lineages is estimated at 710,000–1.8 million years ago, which does not readily align with commonly implicated colonization events between the ocean basins. The estimated age of populations contradicts the prevailing dispersal route for tropical marine biodiversity (Indo-Pacific to Atlantic) with the oldest and most diverse population in the Atlantic, and a recent population expansion with a single common haplotype shared throughout the vast Indian and Pacific oceans. In contrast to the circumtropical fishes, this diminutive reef shrimp challenges our understanding of conventional dispersal capabilities of marine species. 
    more » « less
  3. Abstract

    Genetic diversity within species represents a fundamental yet underappreciated level of biodiversity. Because genetic diversity can indicate species resilience to changing climate, its measurement is relevant to many national and global conservation policy targets. Many studies produce large amounts of genome‐scale genetic diversity data for wild populations, but most (87%) do not include the associated spatial and temporal metadata necessary for them to be reused in monitoring programs or for acknowledging the sovereignty of nations or Indigenous peoples. We undertook a distributed datathon to quantify the availability of these missing metadata and to test the hypothesis that their availability decays with time. We also worked to remediate missing metadata by extracting them from associated published papers, online repositories, and direct communication with authors. Starting with 848 candidate genomic data sets (reduced representation and whole genome) from the International Nucleotide Sequence Database Collaboration, we determined that 561 contained mostly samples from wild populations. We successfully restored spatiotemporal metadata for 78% of these 561 data sets (n = 440 data sets with data on 45,105 individuals from 762 species in 17 phyla). Examining papers and online repositories was much more fruitful than contacting 351 authors, who replied to our email requests 45% of the time. Overall, 23% of our email queries to authors unearthed useful metadata. The probability of retrieving spatiotemporal metadata declined significantly as age of the data set increased. There was a 13.5% yearly decrease in metadata associated with published papers or online repositories and up to a 22% yearly decrease in metadata that were only available from authors. This rapid decay in metadata availability, mirrored in studies of other types of biological data, should motivate swift updates to data‐sharing policies and researcher practices to ensure that the valuable context provided by metadata is not lost to conservation science forever.

     
    more » « less
  4. Abstract

    Genetic data represent a relatively new frontier for our understanding of global biodiversity. Ideally, such data should include both organismal DNA‐based genotypes and the ecological context where the organisms were sampled. Yet most tools and standards for data deposition focus exclusively either on genetic or ecological attributes. The Genomic Observatories Metadatabase (GEOME: geome‐db.org) provides an intuitive solution for maintaining links between genetic data sets stored by the International Nucleotide Sequence Database Collaboration (INSDC) and their associated ecological metadata. GEOME facilitates the deposition of raw genetic data to INSDCs sequence read archive (SRA) while maintaining persistent links to standards‐compliant ecological metadata held in the GEOME database. This approach facilitates findable, accessible, interoperable and reusable data archival practices. Moreover, GEOME enables data management solutions for large collaborative groups and expedites batch retrieval of genetic data from the SRA. The article that follows describes how GEOME can enable genuinely open data workflows for researchers in the field of molecular ecology.

     
    more » « less
  5. Abstract

    Genetic structure within marine species may be driven by local adaptation to their environment, or alternatively by historical processes, such as geographic isolation. The gulfs and seas bordering the Arabian Peninsula offer an ideal setting to examine connectivity patterns in coral reef fishes with respect to environmental gradients and vicariance. The Red Sea is characterized by a unique marine fauna, historical periods of desiccation and isolation, as well as environmental gradients in salinity, temperature, and primary productivity that vary both by latitude and by season. The adjacent Arabian Sea is characterized by a sharper environmental gradient, ranging from extensive coral cover and warm temperatures in the southwest, to sparse coral cover, cooler temperatures, and seasonal upwelling in the northeast. Reef fish, however, are not confined to these seas, with some Red Sea fishes extending varying distances into the northern Arabian Sea, while their pelagic larvae are presumably capable of much greater dispersal. These species must therefore cope with a diversity of conditions that invoke the possibility of steep clines in natural selection. Here, we test for genetic structure in two widespread reef fish species (a butterflyfish and surgeonfish) and eight range‐restricted butterflyfishes across the Red Sea and Arabian Sea using genome‐wide single nucleotide polymorphisms. We performed multiple matrix regression with randomization analyses on genetic distances for all species, as well as reconstructed scenarios for population subdivision in the species with signatures of isolation. We found that (a) widespread species displayed more genetic subdivision than regional endemics and (b) this genetic structure was not correlated with contemporary environmental parameters but instead may reflect historical events. We propose that the endemic species may be adapted to a diversity of local conditions, but the widespread species are instead subject to ecological filtering where different combinations of genotypes persist under divergent ecological regimes.

     
    more » « less
  6. Abstract Aim

    To test hypothesized biogeographic partitions of the tropical Indo‐Pacific Ocean with phylogeographic data from 56 taxa, and to evaluate the strength and nature of barriers emerging from this test.

    Location

    The Indo‐Pacific Ocean.

    Time period

    Pliocene through the Holocene.

    Major taxa studied

    Fifty‐six marine species.

    Methods

    We tested eight biogeographic hypotheses for partitioning of the Indo‐Pacific using a novel modification to analysis of molecular variance. Putative barriers to gene flow emerging from this analysis were evaluated for pairwise ΦST, and these ΦSTdistributions were compared to distributions from randomized datasets and simple coalescent simulations of vicariance arising from the Last Glacial Maximum. We then weighed the relative contribution of distance versus environmental or geographic barriers to pairwise ΦSTwith a distance‐based redundancy analysis (dbRDA).

    Results

    We observed a diversity of outcomes, although the majority of species fit a few broad biogeographic regions. Repeated coalescent simulation of a simple vicariance model yielded a wide distribution of pairwise ΦSTthat was very similar to empirical distributions observed across five putative barriers to gene flow. Three of these barriers had median ΦSTthat were significantly larger than random expectation. Only 21 of 52 species analysed with dbRDA rejected the null model. Among these, 15 had overwater distance as a significant predictor of pairwise ΦST, while 11 were significant for geographic or environmental barriers other than distance.

    Main conclusions

    Although there is support for three previously described barriers, phylogeographic discordance in the Indo‐Pacific Ocean indicates incongruity between processes shaping the distributions of diversity at the species and population levels. Among the many possible causes of this incongruity, genetic drift provides the most compelling explanation: given massive effective population sizes of Indo‐Pacific species, even hard vicariance for tens of thousands of years can yield ΦSTvalues that range from 0 to nearly 0.5.

     
    more » « less